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Directed paths: from Ramsey to Ruzsa and Szemeredi 


Po-Shen Loh * 


Abstract 

Starting from an innocent Ramsey-theoretic question regarding directed paths in tourna¬ 
ments, we discover a series of rich and surprising connections that lead into the theory around a 
fundamental problem in Combinatorics: the Ruzsa-Szemeredi induced matching problem. Using 
these relationships, we prove that every coloring of the edges of the transitive n-vertex tourna¬ 
ment using three colors contains a directed path of length at least ^/n ■ " which entirely 

avoids some color. We also expose connections to a family of constructions for Ramsey tourna¬ 
ments, and introduce and resolve some natural generalizations of the Ruzsa-Szemeredi problem 
which we encounter through our investigation. 


1 Introduction 

Ramsey theory is a central area in Combinatorics which concerns structure that must exist in arbi¬ 
trary (but large) configurations. The classical starting point for Graph Ramsey Theory considers 
the number of vertices required in order for every 2-edge-coloring of a sufficiently large complete 
graph to contain a monochromatic copy of a hxed complete graph Kt da [25]. Over the century, 
many other target subgraphs were considered, from the specific, such as paths [ISIj to trees and 
cliques [7|, to entire classes such as graphs with bounded degree [HllinillTI and degeneracy [S]- 

This paper explores an interesting Ramsey-type problem about paths, and so we briefly outline 
some of the history. In undirected graphs, an old result of Gerencser and Gyarfas m established 
that the corresponding 2-color Ramsey number of an n-vertex path is precisely . When the 

number of colors increases beyond two, however, less is known. The result for three colors was 
proved asymptotically by Figaj and Luczak m, and recently for large n by Gyarfas, Ruszinko, 
Sarkozy, and Szemeredi m- 

Another flavor of Ramsey-type results, which also inspires the work in this paper, stems from the 
following celebrated theorem of Erdos and Szekeres d2| : every sequence of n distinct real numbers 
contains a monotone subsequence of length at least y/n. This is often proven by the pigeonhole 
principle. Indeed, assign to each number an ordered pair {x,y), where x tracks the length of the 
longest increasing subsequence ending at that number, and y tracks the decreasing analogue. Since 
the numbers are distinct, it is easy to see that the ordered pairs must also be distinct, and therefore 
some ordered pair has an element at least ^/n. Many extensions have been found for this theorem 
(see, e.g., any of [IS j^Hl (Ml EZl I2S|)- 

The same pigeonhole argument also proves the following straightforward result. Consider the 
transitive tournament on the vertex set {1,... ,n}, where the edge between i < j is oriented in 
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the direction ij. Then, every 2-coloring of the edges of has a monochromatic (directed) path 
with at least ^/n vertices (hereafter called the vertex-length). The generalization to monochromatic 
paths in with r-colored tournaments (with sharp lower bound was independently rediscovered 
multiple times [HEiiiH]. This problem can also be restated in terms of the ordered Ramsey number 
of a path; see, e.g., any of [a El na i23i HU for a discussion of recent work in ordered Ramsey 
Theory, several of which consider paths (in the context of hypergraphs). 

This paper discusses several questions that start from this classical argument, leading to sur¬ 
prising and interesting connections across Combinatorics. This serendipitous tour passes through 
directed graph Ramsey theory, to /c-majority tournaments (applied to create constructions for Ram¬ 
sey tournaments), and ultimately to the borders of a foundational problem of Ruzsa and Szemeredi 
on induced matchings. 

Indeed, with two colors, the concept of “monochromatic” is unambiguous. With three or more 
colors, however, there is another natural generalization of the basic concept: one can seek substruc¬ 
tures which are 1-color-avoiding, i.e., in which there is at least one color which does not appear at 
all. (In the 2-color setting, these two concepts are identical.) Already in the 3-color setting, this 
problem is not well-understood. 


Problem 1.1. Determine f{n), the maximum number such that every 3-coloring of the edges of 
the n-vertex transitive tournament contains a directed path with at least f{n) vertices, which avoids 
at least one of the colors. 


By merging two of the three color classes, one may obtain a bound /(n) > ^/n using the 2-color 
result, and the following standard construction achieves f{n) < when n is a perfect cube: 
identify the vertex set with the triples {!,...,n}^, ordered lexicographically, and color an edge 
{x,y,z) —)> {x',y',z') based upon the leftmost position at which the triples differ. The best result, 
however, is still unknown. 

It turns out that there are interesting connections between this problem and a longstanding 
open question which traces back to a result of Ruzsa and Szemeredi |26) on induced matchings, 
which has rich connections to Szemeredi’s Regularity Lemma (see, e.g., the survey |21ji. 


Problem 1.2. Let M{n,k) be the maximum number of edges in an n-vertex graph G = {V,E) 
whose edge set is the union of k induced matchings. That is, E = Ei L) ... L) Ej., where each Ei 
is precisely the set of edges induced by ^[V)] for some subset Vi C V, and each Ei is a matching. 
Determine the asymptotics of M(n, k). 


Leveraging this connection, and using the current best bounds on Problem 1.2 (due to Fox’s 
result on the Triangle Removal Lemma), we obtain a non-trivial lower bound on f{n). Here, 
log* is the iterated logarithm, or the inverse of the tower function T{n) = T{0) = 1. Also, 

we write f{n) = Ll{g{n)) to mean that there exists a positive constant c such that /(n) > cg{n) for 
all sufficiently large n. 


Theorem 1.1. Every 3-coloring of the edges of the n-vertex transitive tournament contains a 
directed path of length at least Ll{y/n ■ ”), which is missing at least one color. 


Furthermore, we discover that Problem 1.1 is related to an “ordered” variant of the following 
generalization of the Ruzsa-Szemeredi problem. 


Problem 1.3. Let Mi(n,k) be the maximum number of edges in a n-vertex bipartite graph G = 
iy, E) whose edge set is the union ofk matchings which are l-separated. That is, E = EiL).. .DE^, 
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where each Ei is a matching, and there is no sequence of vertices ao,ai,... ,at with t <1, such that 
ao and at are both incident to distinct edges from the same Ei, and each of a^ai, aia 2 , ■ ■ ■, at-iat 
are edges of E. Determine the asymptotics of Mi{n,k). 


Note that when 1 = 1, this corresponds to the induced matchings from Problem |1.2[ In our 
application, we are most interested in the symmetric bipartite case, which corresponds to M;(2n, n). 
We completely resolve the unordered version stated above. It turns out that already for I = 2, the 
exponent immediately drops, and remains level for all higher I, so that the unique behavior is 
special to / = 1. 

Theorem 1.2. For all positive integers I > 2 and n, we have Mi{2n,n) = (1 + o(l))n^/^. 


Finally, we sh ow th at the same bound on the ordered variant (formulated in Section 2.3) would 
to achieve the asymptotically sharp bound /(n) > 


improve Theorem 


1.1 


2 Analysis 

We start with directed paths that avoid at least one color. In this setting, the result is not even 
known for transitive tournaments, and so we focus on that case here. Recall that f{N) is defined 
as the maximum number n such that every 3-coloring of the edges of the transitive Wvertex 
tournament contains a directed path of vertex-length at least n, which avoids at least one of the 
colors. 

2.1 Reformulation and fc-majority tournaments 

We begin by stating an equivalent and independently natural problem. The following statement is 
more convenient for relating with fe-majority tournaments, and this connection will be detailed at 
the end of this section. 

Problem 2.1. Determine E{n), the maximum number of triples in a sequence Li, L 2 , ■ ■ ■ such that 
every coordinate is an integer between 1 and n inclusive, and for every i < j, there are at least two 
coordinates in which Li is strictly less than Lj. 

Example. The sequence (3,2,7), (7,3,2), (1,8,8), (2,9,9), ... is valid. Indeed, when comparing 
the second and fourth triples, note that although the first coordinate decreases from 7 to 2, the 
second and third coordinates both increase, from 3 to 8 and 2 to 8, respectively. 

It turns out that Problem o (Ramsey for 1-color-avoiding paths) and Problem 
completely equivalent, as established by the following lemma. 

Lemma 2.1. For any positive integers n and N: 

(i) if f{N) < n, then F{n) > N; and 

(ii) if f{N) > n, then F{n) < N; and thus 

(iii) the values of F are precisely the arguments at which f increases: F{n) = N if and only if 
f{N) = n and f{N + 1) > n. 


2.1 (above) are 
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Proof. For part (i), f{N) < n means that there is a 3-coloring of the edges of the transitive 
tournament on vertex set {1,.. -N}, with all 1-color-avoiding paths having vertex-length at most 
n. Consider this coloring, and follow the Erdos-Szekeres proof. Associate a triple of numbers to 
each vertex, with Lk = {xk,yk,Zk) corresponding to the /c-th vertex, where Xk is the number of 
vertices in the longest path that ends at the k-th vertex and avoids the first color, and yk and Zk 
are the corresponding statistics for avoiding the second and third colors, respectively. 

Consider any two triples Lj and Lj with i < j. Since there are three colors overall, there are 
two colors which differ from the color of edge ij. Let c be one of then^ Then, the longest path 
which avoids color c and ends at the i-th vertex can be extended by edge ij , to end at j. Therefore, 
the corresponding coordinate (for color c) of Lj must be strictly less than that of Lj. This holds 
also for the other color which differs from ij, and so Lj is strictly less than Lj in at least two 
coordinates. We have verified the necessary properties produce a valid construction for Problem 


2.1 with N triples, all of whose entries are at most n. This implies F{n) > N. 

To prove (ii), fix an arbitrary sequence of triples Li, L 2 ,..., Ljv with the property that for every 
i < j, there are at least two coordinates in which Lj is strictly less than Lj, and all coordinates are 
positive integers. We will show that some triple contains an integer strictly greater than n. Indeed, 
let us construct a 3-edge-coloring of a transitive tournament on vertex set {1,..., A^}, where edges 
are directed in the natural forward increasing ordering. For each pair i < j, there is always a choice 

of one of the three coordinates such that Lj is strictly less than Lj in the other two coordinates. (If 

Lj is strictly less than Lj in all three coordinates, then there are three choices for this coordinate; 
disambiguate by selecting the first coordinate.) Use this coordinate as the color of the edge. Since 
f{N) > n by assumption, there is a I-color-avoiding directed path with more than n vertices. 

At the same time, the following dynamic programming algorithm computes the length of the 
longest I-color-avoiding directed path in a transitive tournament. Iteratively construct triples 
Ti,..., Tjv, where each Tk = {xk,yk, Zk) records the vertex-lengths of the longest I-color-avoiding 
paths ending at the k-th vertex, avoiding the first, second, and third color, respectively. This can 
be achieved by starting with Ti = (I, I, I), and at each step, given Ti,... ,Tk-i, constructing Tk 
by setting 

Xk = 1 + max{xj : I < j < k and jt: not in color 1} 

yk = 1 maxjyj : 1 < j < k and 'jt: not in color 2} 

Zk = t + max{zj : 1 < j < k and jt: not in color 3}, 


with the convention that the maximum is taken to be 0 if it is taken over the empty set. (This is 
actually consistent with the definition of Ti.) 

By construction of the tournament coloring, we inductively have that for each k, the triple Tk 
is less than or equal to the given triple Lk in every coordinate. Since there is a I-color-avoiding 
directed path with more than n vertices, this implies that some entry of some Lk exceeds n, as 
claimed. 

The hnal statement (iii) hinges on two observations: the sequence /(I), /(2), /(3),... is mono¬ 
tonic, and consecutive terms increase by at most one. To see that the latter is true, consider an 
arbitrary 3-edge-coloring of the transitive A^-vertex tournament, and extend it to a 3-edge-coloring 
of the transitive {N -|- I)-vertex tournament by coloring all edges to the {N + I)-st vertex in the 
first color. This would then contain a I-color-avoiding path of vertex-length at least f{N + I), 
which interacts with the (A^ -|- I)-st vertex in its final vertex, if at all. Therefore, the original 
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3-edge-coloring of the A^-vertex tournament contained a 1-color-avoiding path of vertex-length at 
least f{N -|- 1) — 1, hence f{N) > f{N -|- 1) — 1, as claimed. 

Now suppose that f{N) = n and f{N -|- 1) > n. By (1), we have F{n) > N, and by (11), we 
have F(n) < iV -|- 1. Since F(n) is an integer, this implies F(n) = N. Furthermore, since we have 
established that /(I), /(2),... is monotonic with consecutive terms increasing by at most one, and 
we also know that /(I) = 1 and the sequence is unbounded, this determines F(n) for every positive 
integer n. Since each F{n) can only take one value, this establishes both directions of the hnal 
if-and-only-if statement. □ 


Problem 2.1 is independently interesting in the above formulation, as it connects to the area of 
/c-majority tournaments. 


Definition 2.1. Given a set of k linear orders on a common ground set X (with k odd), the 
corresponding k-majority tournament has vertex set X, with an edge directed from x to y if and 
only if a majority of the k orders rank x < y. 


These types of tournaments have been actively studied in social science [22] , as well as in Com¬ 
binatorics, where previous researchers studied extremal problems regarding the sizes of dominating 
sets |2| and transitive subtournaments [23]. Problem 2.1 is connected to bounding transitive sub¬ 
tournaments in a natural class of constructions of fc-majority tournaments. Indeed, the author 
actually came upon the study of this problem in the course of analyzing the following construction, 
which may potentially be of independent interest. 


Construction 2.1. Let n and k he positive integers, and letai,..., Ok, be permutations o/{l,..., k}, 
such that iTi(l) = i for each i. Define the vertex set X = {I,... ,n}^ to be the collection of k-tuples 
of integers between 1 and n inclusive. For each of the k permutations Oi, construct a linear order on 
X by taking the lexicographical order where <Tj(l) is the most significant coordinate, <Tj(2) is the next 
most significant, and so on. Finally, construct a k-majority tournament from these permutations. 


The central problem in constructing Ramsey tournaments is to avoid large transitive subtour¬ 
naments. Strictly speaking, to estimate the size of the largest transitive subtournament in this 
class of construction, one must consider all elements of each permutation, because ties need to be 
broken. However, when n is large, the main asymptotic may be dictated by transitive subtourna¬ 
ments which are entirely resolved by the first element of every permutation (with no ties). This 
type of transitive subtournament is then a sequence of A:-tuples in which every pair in the sequence 


strictly increases in a majority of the coordinates. When k = 3, this is precisely Problem 2.1 


2.2 Induced matchings and Ruzsa-Szemeredi 


In light of the previous section, it suffices to focus on Problem 2.1 which asks to upper-bound the 
lengths of sequences of triples Li,L 2 ,... from {1,2,..., n}^, in which every pair of triples increases 
in at least two coordinates. The following trivial observation implies that the sequence length is at 
most 


Observation 2.1. For each fixed x*, y*, the sequence has at most one triple of the form (x*, y* ,z). 
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To improve upon this trivial bound F(n) < we uncover a nice connection to the celebrated 
Ruzsa-Szemeredi induced matching problem by using the sequence of triples to construct the fol¬ 
lowing bipartite graph G. Let the vertex set be {ai,..., an, bi,..., bn}- For each triple {x, y, z), 
place an edge between ax and by, and label it with z. 


Lemma 2.2. For each label z G {1,..., n], the set of edges labeled z forms an induced match¬ 
ing: the subgraph of G (which includes all edges of all labels) induced by all vertices spanned by 
those edges of is a matching. 


Proof. Fix z. By Observation 2.1 for each x, there is at most one triple of the form (x,-k,z), and 
so each vertex ax is incident to at most one edge labeled z. Similarly, each vertex by is incident to 
at most one edge labeled z. Therefore, in the subgraph of G induced by all vertices spanned by 
those edges of E^, the edges of Ez form a matching. 

It remains to show that no other edges (from other labels) are present in this induced subgraph. 
Indeed, suppose for contradiction that vertices ax and by are both incident to edges labeled z, but 
the edge axby itself is labeled F 7 ^ z. This means that the triple sequence contains the following 
triples, in some order (not necessarily the order presented): 


{x,y',z) 


{x',y,z) 

{x,y,z') 

Without loss of generality, suppose that the first two triples come in the relative order presented. 
Since both share the same third coordinate 2 , but must strictly increase in at least two coordinates, 
we have 

X < x' and y' < y. (I) 

Similarly, since the first and third triples both agree in the first coordinate x, there must be a 
strict increase in both of the other two coordinates from one to the other. Since we already have 
y' < y from Q, we must therefore also have z < z'. 

Finally, since the second and third triples both agree in the second coordinate y, there must be 
a strict increase in both of the other two coordinates from one to the other. However, we already 
have x' > X and z < z', which are in opposite directions, which is a contradiction. □ 


We are now ready to prove our theorem. 

Proof of Theorem The induced matching property places us squarely in the context of Problem 


1.2 


which studies M{n, k), the maximum number of edges in an n-vertex graph whose edge set is 
the union of k induced matchings. Indeed, we immediately have F{n) < M(2n,n), because our 
bipartite graph G has exactly P{n) edges. The original result of Ruzsa and Szemeredi [M] already 


breaks below the trivial quadratic bound, and the current best Tri angl e Removal Lemma bound of 
Fox [13] gives M(2n, n) < 0(n^/e^°® ^), which, in light of Lemma 


2.1 


proves our result. 


□ 
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2.3 Ruzsa-Szemeredi generalizations 

It turns out that the bipartite graphs constructed from our triple sequences have richer properties 
beyond the induced matching phenomena. Indeed, the set of edges labeled with each fixed value z 
forms an “ordered” induced matching, as illustrated in Figure 

Observation 2.2. For each label z G n}, the edges labeled z are of the form (aq,5j^), 

(ajj, bj^), ..., with aq < ai^ < ■ ■ ■ and bj^ < bj^ < ■ ■ ■. 



Figure 1: Ordered matching. Note that the matching edges do not cross each other. 


This additional ordering property alone is insufficient to significantly improve the result, because 
Ruzsa and Szemeredi’s lower bound construction [26], based upon Behrend ’s nin] three-term 
arithmetic progression free sets, actually already produces ordered induced matchings. Their graph 
has about edges, so even if the additional order condition has an effect, it cannot improve 

the exponent on n. However, our collection of ordered induced matchings still satisfies an additional 
condition, which we call Tj-free (illustrated in Figure]^. 

Lemma 2.3. There are no vertices bh,ai,bj,ak,bi such that i < k and h < j < I, edges aibh and 
akbi are both labeled z, and edges Oibj and Ukbj are both in the graph (hut may carry labels different 
from z). 


Remark. By choosing j = / or /i = j, we recover the ordered induced matching condition, so this 
is stronger than the traditional context of Ruzsa-Szemeredi. 


Proof of Lemma 2.3. Lemma 2.2 already established that the edge labels provide a partition into 
induced matchings, so we may assume that h, j, and I are distinct. Let x be the label of the edge 
aibj. We then have two triples {i,h,z) and {i,j,x) with h < j, so since one triple must increase 
in at least two coordinates, this forces z < x as well. Let y be the label of the edge Okbj. A 
similar argument considering {i,j,x) and {k,j,y) implies that x < y. However, the final edge a^bi 
gives rise to {k,l,z), and when we compare that triple to {k,j,y), we find that they match in one 
coordinate, we already have deduced that z < x < y and j < I, so neither can exceed the other in 
two coordinates. This contradiction establishes the claim. □ 
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Figure 2: Prohibited structure in S-free matchings. The two outer edges both share the same label, 
but the connecting edges are differently labeled. 


Lemma 2.3 now motivates Problem 1.3 from the introduction, which is a natural direction of 


generalization to the Ruzsa-Szemeredi induced matching problem: rather than having two edges of 
the same matching linked by a single edge of another matching, disallow edges of the same matching 
to be linked by a path of length up to I (possibly using edges of different labels). Recall that we 
defined Mi{n, k) to be the maximum number of edges in a n-vertex bipartite graph whose edge set 
is the union of k matchings which are /-separated. 

Lemma 2.3 actually provides an “ordered” condition which is more restrictive than the for¬ 


bidden structures in Problem 1.3, so upper bounds on Mi{2n,n) do not necessarily transfer over 
to our original question. However, due to its proximity to the well-studied problem of Ruzsa and 


Szemeredi, Problem 1.3 is of independent interest. We now proceed to prove the sharp upper bound 


on M/(2n,n) for all I > 2. 


Proof of Theorem 1^. We start by proving that M2(2n,n) < Suppose that we have a (2n)- 


vertex bipartite graph which is a disjoint union of 2-separated matchings Fli,..., E^- Associate a 
set Sv of indices i to each vertex u, which records which Ei contain edges incident to v. Now, if 
there is a vertex v such that two distinct neighbors x and y are both incident to edges of the same 
Ei, then the 2-separation condition is violated. (Here we use the assumption that the graph is 
bipartite: they cannot both be incident to the same edge.) Thus, for each vertex w, its neighbors 
have disjoint Sy. 

Since each Ei is a matching, each vertex is incident to at most one edge of each Ei. Therefore, 
for every vertex, the sum of the degrees of its neighbors is at most the number of matchings n; 
summing over all 2n vertices, the number of two-edge walks in the graph is at most (2n)(n) = 2n^. 
Since this is also precisely the sum of the squares of the degrees, convexity implies 


2re^ = 


> {2n){df 


where d is the average degree. Therefore, the average degree is at most ^/n, which implies that 
there are at most ^{2n)y/n edges, and M2(2n,n) < as claimed. Since Mi{2n,n) is clearly 
monotonic in I, this then immediately implies that Mi{2n, n) < for all / > 2 as well. 









For the lower bound, we will prove that this bound is exactly sharp for every perfect square 
n through an explicit construction. Start with the disjoint union of ^/n copies of the complete 
bipartite graph Each component has exactly n edges; assign one to each Ei. Now, every 

pair of distinct edges from the same Ei actually comes from two different connected components, 
and are therefore at infinite distance in the graph. Hence, for every perfect square n and every 
/ > 2, we have Mi{2n, n) = which completes the proof. □ 


Remark. If the bound in Theorem 1 1 . 2| could be es tabl ished in the weaker S-free setting, we would 
immediately achieve F{n) < which by Lemma 2.1 would produce the sharp bound /(n) > 
on our original Ramsey question on 1-color-avoiding paths. 


3 Concluding remarks 


This study has exposed many interesting problems, ranging from Ramsey theory, to digraphs, 
and on to Regularity. The original open problem is to improve the Ramsey bounds for 1-color- 


avoiding paths in 3-edge-colorings of transitive tournaments (Problem 1.1). It is also interesting 


to consider general tournaments. There may also be interesting phenomena when more than three 
colors are involved. Indeed, there is a whole spectrum of Ramsey-type problems where the notion 
of monochromatic slides across the scale from 1-color to 1-color-avoiding. 

There are also interesting new questions which border on the area of Szemeredi’s Regularity 
Lemma. Our Theorem 1.2 handles (unordered) ^-separated matchings. It would be interesting to 
prove the analogue of the M2(2n, n) upper bound for S-free matchings (which are ordered). The 
S-free condition naturally generalizes to ordered notions of /-separation between matchings, and 
indeed, our graphs satisfy all properties along this successively richer hierarchy. Any improvement 
anywhere along this hierarchy would directly translate into improvements for our problem. 

Finally, it would be interesting to analyze the Ramsey tournaments produced by well-chosen 
collections of permutations in the class of Construction 2.1 If the transitive subtournaments are 


indeed small, this could give rise to new alternative constructions in that area of study. 
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